CDS

Accession Number TCMCG075C17848
gbkey CDS
Protein Id XP_007030499.2
Location complement(join(36436182..36436275,36436369..36436475,36436554..36436648,36436743..36436860,36436939..36437036,36437241..36437357,36437466..36437553,36437693..36437866,36437970..36438117,36438274..36438347,36438552..36438607,36439062..36439200,36439319..36439381,36439583..36439694,36439779..36439997,36440256..36440515))
Gene LOC18600133
GeneID 18600133
Organism Theobroma cacao

Protein

Length 653aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007030437.2
Definition PREDICTED: phosphoribosylaminoimidazole carboxylase, chloroplastic [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category F
Description phosphoribosylaminoimidazole carboxylase
KEGG_TC -
KEGG_Module M00048        [VIEW IN KEGG]
KEGG_Reaction R04209        [VIEW IN KEGG]
KEGG_rclass RC00590        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K11808        [VIEW IN KEGG]
EC 4.1.1.21        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCTTCAAAAGAGCTCCAACGCAGTGTTTTCGAGCTCCTCTGAATCCTCCTCATTCTTTGCTTTCAGACCATCTCAGTCTCTTTTACCGCCGCCTACACCTCTCTCACTCCGCTTTTTCTCCATGACCGCCGACGATGACCAACCCCACCACCGCAAGCTTCACCTGCGCCACTCTTCTACTTCTTCTTTCAAGCTCCTCAATCCCGTCCTCGCTTGCCGCGCCTCACCTGACTCTCACCAGACCAGCTCCTCTCTCAGGAATGATGATGGCTCCCCAGTTCATGGACTATCCGAAACGATTGTTGGCGTGTTAGGAGGAGGCCAATTGGGTCGTATGCTATGCCAAGCAGCTTCAAAAATGGCCATTAAAGTCATGGTTTTGGACCCTTCAGAGAATTGCCCAGCTAGTGCCCTCGCTTATGATCATATGGTTGGGAGCTTCGATGACAGCGCTACTGTTCAAGAATTTGCAAAAAGATGTGGAGTTTTGACGGTTGAAATTGAACATGTGGATGTTGCCACTCTAGAGAGGCTTGAGCAACAAGGAGTGGATTGCGAACCTAGAGCTTCTACCATTCGAATTATCCAAGATAAATATCTCCAGAAAGTTCATTTTTCTCAGCATGCCATTCCACTTCCTGAGTTTATGGAGATTGATGATCTTGAAGGAGCCAAGAGAGCAGGTGACCTATTTGGCTATCCTCTTATGATAAAGAGCAAGAGGTTAGCTTATGATGGGCGTGGAAATGCTGTTGCGAAGAGTGAAGAGGAGCTTCCTTCTGCCGTATCTGCTCTTGGTGGATTTGGTCGTGGTTTGTATGTTGAGAAATGGGCTCCTTTTGTAAAGGAGTTGGCAGTTATTGTAGCTAGAGGAAGAGACAACTCTATCTTGTGCTATCCAGTTGTTGAAACTATTCACAAGGAAAACATATGTCACATCGTTAAGGCACCTGCTGATGTGCCATGGAAGATCAGGAAACTTGCAAATGATGTTGCATATAAAGCTATTAGTTCATTAGAAGGTGCTGGTGTCTTTGCAGTGGAGTTGTTTTTGACGAAGGATGGCCAGATTCTTCTAAATGAAGTAGCTCCCAGACCTCATAATAGTGGTCATCACACAATTGAGTCCTGCTATACATCACAATTTGAACAACATTTACGGGCTGTTGTTGGTCTTCCTCTTGGTGATCCATCCATGAAAACTCCAGCTGCTATCATGTACAATCTTCTGGGTGAGGATGAGGGGGAACCTGGTTTCAAAATGGCTCATCAACTGATAGCAAGGGCACTGGAGATTCCAGGGGCTACTGTTCATTGGTATGATAAGCCAGAAATGCGAAAGCAAAGAAAGATGGGTCATATAACTCTTGTTGGCCCTTCTATGGGTGTTGTGGAAGCACGACTGAATTCAATGCTGAAGGAAGAAGTGTCTGAAAATCAGAATGAAGTTTCACCACGTGTTGGGATTATAATGGGATCTGATTCAGATCTTCCAGTAATGAAGGATGCTGCAAGAATCTTAGATATGTTTGGTGTGTCTTGTGAGGTTAGAATAGTCTCAGCGCACCGAACCCCTGAACTGATGTTTTCTTATGCCTCCTCTGCTCGGGAGCGAGGAATTCAGGTTATCATTGCTGGCGCTGGTGGTGCAGCTCACTTACCAGGTATGGTAGCTGCACTCACACCGTTACCTGTTATTGGTGTCCCAGTCCGGGCTTCTACATTGGATGGAATAGATTCACTCCTGTCAATAGTGCAGATGCCAAGGGGTGTCCCAGTTGCGACAGTTGCAGTAAACAATGCTACTAATGCAGGATTGCTTGCAGTACGGATGTTGGGAGTTGGTGATGCTGATTTATTGGCAAGAATGAGTCAGTATCAAGAAGACACAAGGGACGATGTCTTGACAAAAGCCCAAAGGCTACAAAACAATGGTTGGGAAGCTTATTTAAATCACTGA
Protein:  
MLQKSSNAVFSSSSESSSFFAFRPSQSLLPPPTPLSLRFFSMTADDDQPHHRKLHLRHSSTSSFKLLNPVLACRASPDSHQTSSSLRNDDGSPVHGLSETIVGVLGGGQLGRMLCQAASKMAIKVMVLDPSENCPASALAYDHMVGSFDDSATVQEFAKRCGVLTVEIEHVDVATLERLEQQGVDCEPRASTIRIIQDKYLQKVHFSQHAIPLPEFMEIDDLEGAKRAGDLFGYPLMIKSKRLAYDGRGNAVAKSEEELPSAVSALGGFGRGLYVEKWAPFVKELAVIVARGRDNSILCYPVVETIHKENICHIVKAPADVPWKIRKLANDVAYKAISSLEGAGVFAVELFLTKDGQILLNEVAPRPHNSGHHTIESCYTSQFEQHLRAVVGLPLGDPSMKTPAAIMYNLLGEDEGEPGFKMAHQLIARALEIPGATVHWYDKPEMRKQRKMGHITLVGPSMGVVEARLNSMLKEEVSENQNEVSPRVGIIMGSDSDLPVMKDAARILDMFGVSCEVRIVSAHRTPELMFSYASSARERGIQVIIAGAGGAAHLPGMVAALTPLPVIGVPVRASTLDGIDSLLSIVQMPRGVPVATVAVNNATNAGLLAVRMLGVGDADLLARMSQYQEDTRDDVLTKAQRLQNNGWEAYLNH